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Abstract 

Pulman has shown that Higher-Order 
Unification (HOU) can be used to model 
the interpretation of focus. In this pa- 
per, we extend the unification-based ap- 
proach to cases which are often seen as 
a test-bed for focus theory: utterances 
with multiple focus operators and second 
occurrence expressions. We then show 
that the resulting analysis favourably 
compares with two prominent theories of 
focus (namely, Rooth's Alternative Se- 
mantics and Krifka's Structured Mean- 
ings theory) in that it correctly gener- 
ates interpretations which these alter- 
native theories cannot yield. Finally, 
we discuss the formal properties of the 
approach and argue that even though 
HOU need not terminate, for the class 
of unification-problems dealt with in this 
paper, HOU avoids this shortcoming and 
is in fact computationally tractable. 



1 Introduction 

In this paper, we argue that Higher-Order Uni- 
fication (HOU) provides a linguistically adequate 
tool for modeling the semantics of focus. Building 



up on (Pulman, 1995), we develop a unification- 
based analysis of focus which we show favourably 
compares with two prominent theories of focus, 
Rooth's Alternative Semantics and Krifka's Struc- 
tured Meanings theory. For data which is gener- 
ally viewed as a test-bed for focus theory (utter- 
ances with multiple focus operators and second 
occurrence expressions), we show that contrary 
to Rooth's and Krifka's theories, the HOU treat- 
ment yields a transparent analysis while avoiding 
under- and over-generation. 



2 Focus theory 



Focus is a much debated notion. In this paper, we 
assume a simplified version of Jackendoff's defini- 
tion: a focus is the semantic value of a prosodi- 
cally prominent element. We take the identifica- 
tion of prosodically prominent elements as given. 

To set the stage for this paper, we will briefly 
review the folklore, i.e. the main issues of fo- 
cus theory. It is commonly agreed that focus 
triggers the formation of an additional seman- 
tic value which we will call the Focus Seman- 
tic Value (FSV). The name and definition of 
the FSV varies from author to author: Jackend- 



off (Jackendoff, 1972) calls it the presuppositional 
set, Rooth (Rooth, 1992) the Alternative Set and 
Krifka (Krifka, 1992) the Ground. In this paper, 
we assume a definition of the FSV which is in 
essence Rooth's Alternative set, that is, the set 
of semantic objects obtained by making an ap- 
propriate substitution in the focus position. For 
instance, the FSV of (la) is defined as (lb), the 
set of properties of the form like-ing y where y is 
an individual (in what follows, focus is indicated 
using upper-case; we also follow Montague's con- 
vention that for any type r, D T is the set of objects 
of type r and wff T is the set of wffs of type r). 

(1) a. Jon only likes MARY 
b. {Xx.l(x,y) | y G D e } 

It is also usually agreed that certain linguis- 
tic elements associate with focus in that the 
meaning of the utterance containing these ele- 
ments varies depending on the choice of focus. For 
instance in (2a-b), the focus operator only asso- 
ciates with focus so that the difference in focus be- 
tween (2a) and (2b) induces a difference in mean- 
ing between the two utterances: in a world where 
Jon introduced Paul to Mary and Sarah, and no 
other introduction takes place, (2a) is necessarily 



false whilst (2b) is true. 

(2) a. Jon only introduced Paul to MARY 
b. Jon only introduced PAUL to Mary 
To model this "association-with-focus" phe- 
nomenon, the semantics of associating-elements 
(e.g. focus operators, quantificational adverbs) is 
made contingent on the FSV which itself, varies 
with the choice of focus. The following example 
illustrates this. Suppose that the meaning of only 
is determined by the following rule: 

[NP only VP] 

^ \/P[P e FSV A P(NP') -> P = VP 1 ] 
where NP' , VP' represent the meaning of NP 
and VP respectively, and FSV stands for the fo- 
cus semantic value of the VP. As we have seen 
above, the FSV of (la) is (lb), hence by the above 
semantic for only, the semantics of (la) is: 
yP[Pe{Xx.l(x,y)\y&D e }AP(j) 

— » P = Xx.l{x, m)\ 
Intuitively, the only property of the form like- 
ing y that holds of Jon is the property of like-ing 
Mary. 

3 The basic analysis 

For computing the Focus Semantic Value, we 
propose to use Higher-Order Unification. More 
specifically, given (part of) an utterance U with 
semantic representation Sem and foci F 1 . . . F n , 
we require that the following equation, the 
ground equation, be solved: 

Sem = Gd(F 1 )...(F n ) 
Assuming the typed A-calculus as our seman- 
tic representation language, this equation can be 
solved by Huet's algorithm (cf. (Huet, 1975)), 



thus assigning a value to Gd. On the basis of this 
value, we can then define the FSV, written Gd, as 
follows: 

Definition 3.1 (Focus Semantic Value) 
Let Gd be of type a — (3/- — > t and n be the number 
of foci (n < k), then the Focus Semantic Value 
derivable from Gd, written Gd, is {Gdit 1 . . . t n ) 

As mentioned before, this yields a focus seman- 
tic value which is in essence Rooth's Alternative 
SetQ. 

1 Though in fact, our definition is more syntactic 
than Rooth. In Rooth's approach, the FSV definition 
is purely semantic whereas in our approach the FSV is 
indirectly defined by solving equations and the value 
thus obtained (i.e. the value of Gd) is a term, that is, 
a syntactic object. Hence, our FSV can be more accu- 
rately compared to Kratzer's presupposition skeleton,. 
This means that our approa ch inherits the advantages 
of Kratzer's approach (cf. (Kratzer, 1991)). In par- 



Finally, we assume as in flPulman, 1995| ), 
that foci are stored and discharged non- 
deterministically as the need arises, thus con- 
tributing to the definition of the ground equation. 
Furthermore, equations are set up at the level at 
which there are needed e.g. at the VP level in the 
case of a pre- verbal focus operator. 

To illustrate the workings of our approach, we 
now run through a simple example. Consider (la). 
To determine the meaning of only likes MARY, 
the FSV of the VP must be known. Hence the 
following equation must be solved: 

\x.l{x,m) — Gd(m) 

By HOU, the value of Gd is theng: 
Gd = Ay\x.l(x, y) 

And by definition (3.1), the FSV is: 

Gd = {Xx.l(x, y) \y £ wff e } 

Assuming the semantic of only given above, the 
semantic representation of (la) is then: 

VP[PE{\xJ(x,y)\ye wff e }AP(j) 
— > P = XxJ(x, m)\ 
In short, we obtain a reading similar to that of 
Rooth, the difference being in the way the FSV is 
determined: by HOU in our approach, by means 
of a semantic definition in Rooth's. 

4 Linguistic applications 

In this section, we show that the HOU approach 
favourably compares with Rooth's and Krifka's 
analysis in that it correctly generates interpreta- 
tions which these two theories fail to yield. As 
we shall see, the main reason for this is that the 
HOU approach makes minimal assumptions about 
the role syntax plays in determining the FSV. In 
particular, it relies neither on the use of Quantifier 
Raising, nor on the assumption of a rule-to-rule 
definition of the FSV. In this way, it avoids some 
of the pitfalls these theories encounter. 



ticular, it adequately captures the interaction of focus 
with VP ellipsis as illustrated by Kratzer's notorious 
example: / only went to TANGLEWOOD because you 
did. 

2 Unification yields another possible value of Gd, 
namely XyXx.l(x,m). In what follows, we assume a 
restriction similar to the DSP's Primary Occur- 
rence Restriction ( Dalrymple et al., 1991 )'s: the 
occurrence directly associated with the focus is a pri- 
mary occurrence and any solution containing a pri- 
mary occurrence is discarded as linguistically invalid. 
For instance, m is a primary occurrence in the equa- 
tion \x.l(x,m) — Gd(m) so that the solution Gd — 
Xy\x.l(x,m) is invalid. For a formal treatment of 
DSP's Primary Occurrence Restriction and a, discus- 
sion of how it can b e extended to focus, see ( |G ardent 
and Kohlhase, 1996|). 



We begin by a brief summary of Rooth's and 
Krifka's theories and stress the properties relevant 
for the present discussion. We then confront the 
three theories with the data. 

4.1 Two alternative theories of focus 

Rooth's Alternative Semantics 

In Rooth's approach, the FSV is defined by re- 
cursion on the truth-conditional structure which 
is itself derived from LF (i.e. Logical Form, the 
Government and Binding level of semantic rep- 
resentation). Focus is then seen as introducing a 
free variable whose value is determined by the cur- 
rent context and is furthermore constrained to be 
an element or a subset of the FSV. For our pur- 
pose, the following characteristics are particularly 
important: 

• Given Rooth's definition of the Alternative 
Set, a focus operator associates with any fo- 
cus occurring in its scope. 

• Any NP may be subject to Quantifier Rais- 
ing. Importantly, this includes focused NPs. 

• Quantifier Raising may not apply to quanti- 
fiers occurring in a scope-island. 

Note that Rooth's approach critically relies on 
quantifier raising as a means of moving a focused 
NP out of the scope of a focus operator. However 
this only applies if the focus NP is not embedded 
in a scope island. 

Krifka's Structured Meanings 

Krifka's approach defines a rule-to-rule seman- 
tics which assigns to any syntactic constituent, a 
meaning which can be either a A-term or a struc- 
tured meaning, i.e. a tuple of the form (Gd, F) 
where Gd is Krifka's Focus Semantic Value and F 
is a (possibly complex) focus. 

For our purpose, an important characteristic of 
Krifka's approach is the tight syntax/semantic in- 
teraction it presupposes. In particular, the theory 
requires that a focus operator combines with a 
syntactic constituent C whose structured seman- 
tics C" = {Gd, F) provides the focus (F) this op- 
erator associates with. In other words, the right- 
adjacent sibling of a focus operator must contain 
all and only the foci this operator associates with. 
As we shall later see, some of the data does not 
seem to square with this assumption. 



4.2 Multiple Focus Operators 

Utterances with multiple focus operators^ are 
known pathological cases of focus theory: 

(3) a. (Jon onlyi read the letters 

that Sarah sent to PAULi) 
b. Jon also2 onlyi read the letters 

that SUE 2 sent to PAULi. 
In the given context, the preferred reading of 
(3b) can be glossed as follows: it is also the case 
for SUE 2 , that Jon onlyi read the letters she sent 
to PAULi - i.e. Jon didn't read the letters she 2 
sent to e.g. Peter. In other words, the preferred 
reading is that also 2 associates with SUE 2 and 
onlyi with PAULi. 

The HOU analysis 

Under the HOU approach, (3b) is analysed as 
follows. First, the meaning of onlyi read the let- 
ters that SUE 2 sent to PAULi is derived. To de- 
termine the FSV of the VP, the ground equation 
(4b) must be solved for which (4c) is a solution. 
Applying the semantics of only given in section |^, 
the semantics of (4a) is then as given in (4d)0. 

(4) a. onlyi read the letters that SUE 2 

sent to PA UL\ 

b. G 1 (p) = Xx.read(x,l(s,p)) 

c. G 1 — Xy.Xx.read(x,l(s,y)) 

d. Xz.VP[P G XyXx.read(x,l(s,y)) A P(z) 

— > P = Xx.read(x,l(s,p))] 
Analysis then proceeds further and the ground 
equation 

G 2 (s) = Xz.yP[ P G XyXx.read(x,l(s,y)) 

AP(z) —fP = Xx.read(x, l(s,p))] 

must be solved to determine the meaning of also 2 
onlyi read the letters that SUE 2 sent to PAULi. 
A possible solution for G 2 is 

Xu.Xx.Xz.\/P[ P G XyXx.read(x,l(u,y)) 

AP(z) — > P = Xx.read(x, l(u,p))} 

Assuming the following semantics for 
NP also VP 

3P[P e FSVAP(NP') AP ^VP'} 

we obtain the desired reading 

3P[ P G XuXx. onlyi read the letters that 
u sent to Pauli 
AP(j) A P =/= Xz.z onlyi read the letters 

that Sue 2 sent to Pauli] 

3 The subscripts indicates which operators associate 
with which focus. There are there for clarity only, and 
have no theoretical import. 

4 For clarity, we have simplified the semantic repre- 
sentation of (3b); nothing hinges on this. 



Comparison with Rooth and Krifka 

under the Alter- 



As mentioned in section 4.1 



native Semantics approach, a focus operator nec- 
essarily associates with any focus occurring in its 
scope. Furthermore in (3b), the scope of onlyi 
is the whole VP read the letters that SUE2 sent 
to PAUL\. Hence, if no quantifier raising occurs, 
onlyi associates with both SUE2 and PAXJL\. 
Thus in order to generate the desired reading, 
SUE2 must be moved out of the scope of onlyi. 
However, since the NP the letters that SUE2 sent 
to PA UL\ is a scope island, quantifier raising is 
impossible. Hence, the desired reading cannot be 
generated^ 

Recall that in the Structured Meanings ap- 
proach, the right-sibling of a focus operator must 
contain all and only the focus this operator as- 
sociates with (cf. section 4.1). Hence, to gener- 
ate the desired reading in (3b), there must exist 
a syntactic constituent which is right-adjacent to 
onlyi and which contains PA UL\ but not S'fT^R; 
similarly, there must exist a syntactic constituent 
which is right-adjacent to also and which contains 
SUE2 but not PAULx. Given standard assump- 
tions about syntax, such constituents do not exist 
so that the desired interpretation cannot be gen- 
erated. 

4.3 Second Occurrence Expressions 

We call second occurrence expressions (SOE) ut- 
terances which partially or completely repeat a 
previous utterance. Typical cases of SOEs are: 
corrections (5a) , echo-sentences (5b) and variants 
(5c). 

Jon only likes MARY. 
No, PETER only likes Mary. 
Jon only likes MARY. 
Huhu, Peter only likes Mary. 
Jon only likes MARY. 
So what? Even PETER only 
likes Mary. 

An important property of SOEs is that the re- 
peated material is deaccented, that is, it is char- 
acterised by an important reduction in pitch, am- 



(5) 



plitude and duration (cf. (Bartels, 1995)). On the 



other hand, all three theories of focus considered 
here are based on the assumption that focus is 
prosodically marked and thus, identifiable. Hence, 



1992) 



This point is independently noted in (Rooth 



^This is a simplification: the constituent may in 
fact contain SUE2 but this focused NP should already 
have been bound by some focus operator so that the 
focus of the whole constituent only includes PAULi. 
Since no focus operators occur in this constituent, it 
follows that such constituent does not exist. 



the question arises of whether these theories can 
account for SOEs. 

The HOU analysis 

Our proposal is to analyse SOEs as involving a 
deaccented anaphor which consists of the repeated 
material, and is subject to the condition that its 
semantic representation must unify with the se- 
mantic representation of its antecedent. 

This is modeled as follows. Let SSem 
and TSem be the semantic representation of 
the source (i.e. antecedent) and target (i.e. 
anaphoric) clause respectively, and TP 1 . . . TP n , 
SP 1 . . . SP n be the target and source parallel el- 
ementsQ then the interpretation of an SOE must 
respect the following equations: 
An(SP 1 ,...,SP n ) = SSem 
An(TP 1 ,...,TP n ) =TSem 

Intuitively, these two equations require that tar- 
get and source clause share a common semantics 
An, the semantics of the deaccented anaphor. 

Given this proposal, the analysis of (5a) involves 
three equations: 

An(j) = yP[P 6 Xy\x.l(x,y) 

A P(J)_-> P = Xx.l(x, to)] 

An(p) = VP[P e ~Gd A P(p) -> P = XxJ(x, to)] 

Gd(F) = Xx.l(x,m) 
Since neither Gd nor Focus are initially given, 
the third equation above is untyped and cannot 
be solved by Huet's algorithm^]. In that situation, 
we can either assume some delaying mechanism 
or some extension of Huet's al gorithm that can 
cope with typ e variables (cf. ( Dougherty, 1993 ; 
Hustadt, 1991 )). Resolution of the first equation 



yields the following solution: 
An = \yVP[P e {Xx.l(x, y)\y€ wff e } 
AP{z) -> P = Xx.l(x, to)] 
By applying An to p, the left-hand side of the 
second equation is then determined so that the 
second equation becomes 

yP[P £ XyXx.l(x,y) A P{p) -> P = Xx.l(x, to)] 
= VP[P 6 ~Gd A P{p) -» P = Xx.like{x, to)] 
and the value of Gd is identified as being 

Gd = XyXx.l(x, y) 

(Note further, that the third equation can now 
be solved thus yielding the value to for the focus 
F.) That is, the HOU approach to SOEs allows 
us to correctly capture that fact that an SOE can 



7 As in (Dalrymple et al., 1991), we take the identi- 
fication of parallel elements as given - for the moment. 

8 Even thoug h this is not explicitly stated, Pul- 
man's analysis ( |Pulman, 1995 , page 6) faces a similar 
problem. 



inherit its FSV from its source clause (by unifica- 
tion). In ( p ardent et al., 1996| ), we show in more 
detail how the analysis accounts for the interac- 
tion of focus with anaphora and definiteness in the 
case of a particular instantiation of SOEs, namely 
corrections. 

Comparison with Rooth and Krifka 

Under the Alternative Semantics approach, 
SOEs are captured as follows. It is assumed that 
the quantification domain of focus operators is a 
variable whose value is contextually determined. 
In the standard case (i.e. the case where the fo- 
cus is prosodically marked), this quantification do- 
main of focus operators is usually identified with 
the FSV of the VP. However, in the SOE cases, 
the assumption is that the quantification domain 
of focus operators is identified with the FSV of 
the source clause. Thus in (5a), the quantifica- 
tion of only in the second clause is identified with 
the FSV of the preceding utterance i.e. the set of 
properties of the form like-ing somebody. 

But now, consider the following example: 
(6) a. Jon only likes MARY. 

b. * No, PETER only likes Sarah. 

Clearly, this dialog is ill-formed in that (6b) 
is no appropriate correction for (6a). However, 
under the Alternative Semantics approach, it will 
not be ruled out since the FSV of (6a) provides 
an appropriate quantification domain for the fo- 
cus operator in (6b): as required by the semantic 
of pre-verbal only, it is a set of properties whose 
elements can be identified with the VP seman- 
tic value Ax.l(x,m). Hence although Rooth's ap- 
proach captures some cases of SOEs, it does not 
seem to provide an adequate characterisation of 
the phenomena at hand. 

The Structured Meanings proposal distingui- 
shes between proper- and quasi-SOFiS. Proper- 
SOEs involve an exact repetition of some previ- 
ous linguistic material, and are analysed as in- 
volving an anaphor which is constrained by the 
restriction that it be a segmental copy of its an- 
tecedent. For instance, the semantics of only likes 
Mary in (5b) is not determined by the semantics 
of its parts but is instead identified with the se- 
mantic value of its antecedent only likes MARY 
in (5a). In contrast, quasi-SOEs only involve 
semantic equivalence between repeating and re- 
peated material (for instance, in a quasi-SOE a 
repeated element may be pronominalised) . Krifka 
claims that quasi-SOEs have prosodically marked 
foci and thus do not raise any specific difficulty. 

However this theory faces a number of method- 
ological and empirical difficulties. First, it is non- 



compositional because the meaning of the deac- 
cented material in proper-SOEs is solely defined 
by the meaning of its antecedent (rather than the 
meaning of its parts). Second, the prosodic data 
is rather unclear: the assumption that quasi-SOE 
contains a prosodically marked focus is a moot 
point (cf. (Bartels, 1995)) and if it proves to 



be false, the analysis fails to account for quasi- 
SOEs. Third, it is counterintuitive in that it han- 
dles separately two classes of data (i.e. quasi- and 
proper-SOEs) which naturally belong together. 
Indeed, the HOU approach can be shown to pro- 
vide a uni form treatment of qu asi- and proper- 
SOEs (cf. ( iGardent et al., 1996|) ). 

5 Formal properties of the HOU 
approach 

The unification problem can be stated as follows: 
Given two terms of a logic M and N, is there 
a substitution, cr, of terms for variables that will 
make the two terms identical (i.e. er(M) = er(N))? 

It is well-known that for Higher-Order Logic 
(e.g. the typed A-calculus) the space of solutions 
can be infinite and furthermore, the HOU prob- 
lem is only semi-decidable so that the unification 
algorithm need not terminate for unsolvable prob- 
lems. 

Fortunately, in our case we are not interested 
in general unification, but we can use the fact 
that our formulae belong to very restricted syn- 
tactic subclasses, for which much better results 
are known. In particular, the fact that free vari- 
ables only occur on the left hand side of our equa- 
tions reduces the problem of finding solutions to 
higher-order matching, of which decidability has 
been p roven for the subclass of third-order for- 
mulae ( Dowck, 1992 ) and is conjectured for the 
general case. This class, (intuitively allowing only 
nesting functions as arguments up to depth two) 
covers all of our examples in this paper. For a 
discussion of other subclasses of formulae, where 
higher-order unification is computationally feasi- 
ble see ([Prehofer, 1994|). 



6 Conclusion 

In this paper, we have argued that Higher-Order 
Unification provides an adequate tool for com- 
puting Focus Semantic Values. To this end, we 
have considered data which is viewed as a test- 
bed for focus theory and shown that, whilst exist- 
ing theories either under-generate, over-generate 
or are methodologically unsatisfactory, the HOU 
approach yields a simple and transparent analysis. 
There appear to be two main reasons for this. 



First, the HOU analysis makes minimal as- 
sumptions about the role syntax is called to play 
in determining the FSV. It is defined on a purely 
semantic level in the sense that unification oper- 
ates on semantic representations, and relies nei- 
ther on quantifier raising, nor on a rule-to-rule 
definition of the FSV. As we have seen, this type 
of approach is a plausible way to avoid under- 
generation. 

Second, the HOU approach permits an equa- 
tional analysis which can naturally be further con- 
strained by additional equations. The interest of 
such an approach was illustrated in our treatment 
of SOEs which we characterise as involving two 
phenomena: the computation of an FSV, and the 
resolution of a deaccented anaphor. Not only did 
we show that this analysis is methodologically and 
empirically sound, we also showed that it finds a 
natural realisation in the equational framework of 
HOU: each linguistic phenomena is characterised 
by some equation(s) and the equations may mu- 
tually constrain each other. For instance, in the 
case of SOEs, we saw that the equations character- 
ising the deaccented anaphor help determine the 
unidentified FSV of the utterance containing the 
unmarked focus. 

Clearly, our approach extends to cases of ad- 
verbial quantification. For lack of space we could 
not develop the theory here; let us just point 
out that von Fintel's criticism ( von Fintcl, 1995] ) 
of semantic approaches to focus, also applies to 
Krifka's Structured Meanings analysis, but not 
to the HOU approach presented here. Von Fin- 
tel points out that in certain cases of adverbial 
quantification, a focus operator associates with 
an unmarked focus and does not associate with 
a marked focus occurring in its scope - as should 
be clear from this article, this is unproblematic for 
our analysis. 

Of course, there are still many open issues. 
First, how does the proposed analysis interact 
with quantification? Second, how does it extend 
to a dynamic semantics (e.g. Discourse Represen- 
tation Theory)? 
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